152 research outputs found

    A centrality measure for cycles and subgraphs II

    Get PDF
    In a recent work we introduced a measure of importance for groups of vertices in a complex network. This centrality for groups is always between 0 and 1 and induces the eigenvector centrality over vertices. Furthermore, its value over any group is the fraction of all network flows intercepted by this group. Here we provide the rigorous mathematical constructions underpinning these results via a semi-commutative extension of a number theoretic sieve. We then established further relations between the eigenvector centrality and the centrality proposed here, showing that the latter is a proper extension of the former to groups of nodes. We finish by comparing the centrality proposed here with the notion of group-centrality introduced by Everett and Borgatti on two real-world networks: the Wolfe’s dataset and the protein-protein interaction network of the yeast Saccharomyces cerevisiae. In this latter case, we demonstrate that the centrality is able to distinguish protein complexe

    Functional investigation of the coronary artery disease gene SVEP1

    Get PDF
    A missense variant of the sushi, von Willebrand factor type A, EGF and pentraxin domain containing protein 1 (SVEP1) is genome-wide significantly associated with coronary artery disease. The mechanisms how SVEP1 impacts atherosclerosis are not known. We found endothelial cells (EC) and vascular smooth muscle cells to represent the major cellular source of SVEP1 in plaques. Plaques were larger in atherosclerosis-prone Svep1 haploinsufficient (ApoE^{−/−}Svep1^{+/−}) compared to Svep1 wild-type mice (ApoE^{−/−}Svep1^{+/+}) and ApoE^{−/−}Svep1^{+/−} mice displayed elevated plaque neutrophil, Ly6C^{high} monocyte, and macrophage numbers. We assessed how leukocytes accumulated more inside plaques in ApoE^{−/−}Svep1^{+/−} mice and found enhanced leukocyte recruitment from blood into plaques. In vitro, we examined how SVEP1 deficiency promotes leukocyte recruitment and found elevated expression of the leukocyte attractant chemokine (C-X-C motif) ligand 1 (CXCL1) in EC after incubation with missense compared to wild-type SVEP1. Increasing wild-type SVEP1 levels silenced endothelial CXCL1 release. In line, plasma Cxcl1 levels were elevated in ApoE^{−/−}Svep1^{+/−} mice. Our studies reveal an atheroprotective role of SVEP1. Deficiency of wild-type Svep1 increased endothelial CXCL1 expression leading to enhanced recruitment of proinflammatory leukocytes from blood to plaque. Consequently, elevated vascular inflammation resulted in enhanced plaque progression in Svep1 deficiency

    CYGD: the Comprehensive Yeast Genome Database

    Get PDF
    The Comprehensive Yeast Genome Database (CYGD) compiles a comprehensive data resource for information on the cellular functions of the yeast Saccharomyces cerevisiae and related species, chosen as the best understood model organism for eukaryotes. The database serves as a common resource generated by a European consortium, going beyond the provision of sequence information and functional annotations on individual genes and proteins. In addition, it provides information on the physical and functional interactions among proteins as well as other genetic elements. These cellular networks include metabolic and regulatory pathways, signal transduction and transport processes as well as co-regulated gene clusters. As more yeast genomes are published, their annotation becomes greatly facilitated using S.cerevisiae as a reference. CYGD provides a way of exploring related genomes with the aid of the S.cerevisiae genome as a backbone and SIMAP, the Similarity Matrix of Proteins. The comprehensive resource is available under http://mips.gsf.de/genre/proj/yeast/

    Network Archaeology: Uncovering Ancient Networks from Present-day Interactions

    Get PDF
    Often questions arise about old or extinct networks. What proteins interacted in a long-extinct ancestor species of yeast? Who were the central players in the Last.fm social network 3 years ago? Our ability to answer such questions has been limited by the unavailability of past versions of networks. To overcome these limitations, we propose several algorithms for reconstructing a network's history of growth given only the network as it exists today and a generative model by which the network is believed to have evolved. Our likelihood-based method finds a probable previous state of the network by reversing the forward growth model. This approach retains node identities so that the history of individual nodes can be tracked. We apply these algorithms to uncover older, non-extant biological and social networks believed to have grown via several models, including duplication-mutation with complementarity, forest fire, and preferential attachment. Through experiments on both synthetic and real-world data, we find that our algorithms can estimate node arrival times, identify anchor nodes from which new nodes copy links, and can reveal significant features of networks that have long since disappeared.Comment: 16 pages, 10 figure

    Exploiting likely-positive and unlabeled data to improve the identification of protein-protein interaction articles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Experimentally verified protein-protein interactions (PPI) cannot be easily retrieved by researchers unless they are stored in PPI databases. The curation of such databases can be made faster by ranking newly-published articles' relevance to PPI, a task which we approach here by designing a machine-learning-based PPI classifier. All classifiers require labeled data, and the more labeled data available, the more reliable they become. Although many PPI databases with large numbers of labeled articles are available, incorporating these databases into the base training data may actually reduce classification performance since the supplementary databases may not annotate exactly the same PPI types as the base training data. Our first goal in this paper is to find a method of selecting likely positive data from such supplementary databases. Only extracting likely positive data, however, will bias the classification model unless sufficient negative data is also added. Unfortunately, negative data is very hard to obtain because there are no resources that compile such information. Therefore, our second aim is to select such negative data from unlabeled PubMed data. Thirdly, we explore how to exploit these likely positive and negative data. And lastly, we look at the somewhat unrelated question of which term-weighting scheme is most effective for identifying PPI-related articles.</p> <p>Results</p> <p>To evaluate the performance of our PPI text classifier, we conducted experiments based on the BioCreAtIvE-II IAS dataset. Our results show that adding likely-labeled data generally increases AUC by 3~6%, indicating better ranking ability. Our experiments also show that our newly-proposed term-weighting scheme has the highest AUC among all common weighting schemes. Our final model achieves an F-measure and AUC 2.9% and 5.0% higher than those of the top-ranking system in the IAS challenge.</p> <p>Conclusion</p> <p>Our experiments demonstrate the effectiveness of integrating unlabeled and likely labeled data to augment a PPI text classification system. Our mixed model is suitable for ranking purposes whereas our hierarchical model is better for filtering. In addition, our results indicate that supervised weighting schemes outperform unsupervised ones. Our newly-proposed weighting scheme, TFBRF, which considers documents that do not contain the target word, avoids some of the biases found in traditional weighting schemes. Our experiment results show TFBRF to be the most effective among several other top weighting schemes.</p

    No Origin, No Problem for Yeast DNA Replication

    Get PDF
    Eukaryotic DNA replication initiates from multiple sites on each chromosome called replication origins (origins). In the budding yeast Saccharomyces cerevisiae, origins are defined at discrete sites. Regular spacing and diverse firing characteristics of origins are thought to be required for efficient completion of replication, especially in the presence of replication stress. However, a S. cerevisiae chromosome III harboring multiple origin deletions has been reported to replicate relatively normally, and yet how an origin-deficient chromosome could accomplish successful replication remains unknown. To address this issue, we deleted seven well-characterized origins from chromosome VI, and found that these deletions do not cause gross growth defects even in the presence of replication inhibitors. We demonstrated that the origin deletions do cause a strong decrease in the binding of the origin recognition complex. Unexpectedly, replication profiling of this chromosome showed that DNA replication initiates from non-canonical loci around deleted origins in yeast. These results suggest that replication initiation can be unexpectedly flexible in this organism

    A Novel Framework for the Comparative Analysis of Biological Networks

    Get PDF
    Genome sequencing projects provide nearly complete lists of the individual components present in an organism, but reveal little about how they work together. Follow-up initiatives have deciphered thousands of dynamic and context-dependent interrelationships between gene products that need to be analyzed with novel bioinformatics approaches able to capture their complex emerging properties. Here, we present a novel framework for the alignment and comparative analysis of biological networks of arbitrary topology. Our strategy includes the prediction of likely conserved interactions, based on evolutionary distances, to counter the high number of missing interactions in the current interactome networks, and a fast assessment of the statistical significance of individual alignment solutions, which vastly increases its performance with respect to existing tools. Finally, we illustrate the biological significance of the results through the identification of novel complex components and potential cases of cross-talk between pathways and alternative signaling routes
    corecore